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Measuring Teacher and 
Leader Performance 

Cross-Sector Lessons for Excellent Evaluations 

By Julie Kowal and Emily Ayscue Hassel 


EXECUTIVE SUMMARY 


F or too long, performance measurement sys- 
tems in education have failed to document 
and recognize real differences among educa- 
tors. But a recent national push to use performance 
evaluations for critical personnel decisions has high- 
lighted the shortcomings of our current systems and 
increased the urgency to dramatically improve them. 
As state and local education leaders reform teacher 
and principal evaluation systems, they can draw from 
decades of performance measurement research and 
experience in other sectors to develop more accurate, 
reliable, and meaningful information about educa- 
tors’ performance. 

In this report, we summarize six steps that re- 
search and experience from across sectors — includ- 
ing government agencies, nonprofit organizations, 
and for-profit companies — show are critical for 
designing an outstanding performance measurement 
system: 

i. Determine the purposes of performance 
measurement, such as informing professional 
development, promotions, compensation, reten- 
tion, and dismissals. Engaging top leadership in 
conversation about these purposes helps ensure 
that performance measurement systems provide 
the type and quality of information necessary to 
guide each decision. 


2. Choose job objectives that align with the 
organization’s mission to ensure that perfor- 
mance measures and the measurement process 
capture the critical outcomes and behaviors 
needed from each employee to achieve the 
school’s, district’s, or education provider’s 
mission. 

3. Design performance measures, including what 
individuals in each role are expected to contrib- 
ute and the ways in which they are expected to 
achieve results. By choosing the right measures, 
organizations clarify and stimulate staff actions 
that contribute to success. 

4. Set performance standards to use as a yard- 
stick for assessing employees’ performance, so 
that both leaders and staff know what good 
and great performance looks like. 

5. Design the performance measurement 
process by determining who will organize and 
have input into evaluations, using what process, 
and how often. 

6. Use measurement results to take action, in- 
cluding making decisions about professional 
development; promotions and reach extension; 
career planning; compensation; retention and 
dismissals; and future recruiting and hiring. 
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For each of these steps, we share critical research 
findings and examples from other sectors to guide 
the design and implementation of performance 
measurement systems in education. Many of these 
lessons echo elements that reformers have recently 
called for in education , 1 such as: 

> Measuring both the “what” and “how” of educa- 
tors’ performance. Cross-sector research reveals 
clear advantages to including in employees’ evalu- 
ations measures of ultimate results as well as the 
activities that contribute to those results. Com- 
bining both student results and educators’ skills 
and competencies will increase the developmental 
value of evaluations and lay the groundwork for 
better-informed decisions about compensation, 
promotion, and dismissal. 

> Conquering fears of measuring student learn- 
ing. District and state leaders must also consider 
data about teachers’ and leaders’ contributions to 
student progress. After all, schools are in the busi- 
ness of learning — and students’ academic growth 
is the primary outcome. Rather than fear student 
learning results, we must measure them — and 
commit to improve our measurement over time 
rather than using imperfections as an excuse to 
resist such data entirely. 

> Using performance measurement results. Im- 
proving our performance measurement systems 
will have no effect unless the results are put to 
use. States, districts, and other providers must 
commit to act upon real differences in educator 
effectiveness — by using evaluations as the basis 
for teachers’ and principals’ ongoing development 
and pay, decisions about retention and dismissal, 
and future recruiting and selection. 

These are commonplace, well-traveled approaches 
in other sectors. Indeed, there are few compelling 
reasons or existing barriers to prevent us from imple- 
menting them immediately in education. In the final 
section of this report, we put these lessons to use by 
providing sample elements of a strong performance 
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measurement system for teachers, and a detailed ex- 
ample of a method for measuring the “how” of their 
performance. 

Research and experience from other sectors also 
suggest several ways in which successful organiza- 
tions have already catapulted past even the boldest 
calls for reform in education to refine their measure- 
ment systems and obtain the most impact, such as by: 

> Adopting strategies to protect against leniency 
and bias. Tactics include rating on a forced dis- 
tribution, holding managers accountable for the 
appraisals they conduct, including reviews from 
multiple perspectives with different weights, and 
providing training in the performance measure- 
ment process. 

> Meaningfully assessing team performance. 
Where individuals’ contributions occur at the 
team level, great organizations get smart about 
assessing performance in that context. Methods 
include identifying individual contributions to 
team results using peer evaluation and statistical 
analysis, and assessing team outcomes alongside 
individual results. 

>• Measuring more to measure better. Districts, 
schools, and other providers that are committed 
to improving the performance of professionals 
will measure fearlessly and frequently. With a 
simultaneous commitment to use the lessons and 


measurement tools available today and improve 
them in the future, we can rapidly advance perfor- 
mance measurement — and more quickly contrib- 
ute to improved student outcomes. 

> Rapidly adjusting measures that do not predict 
success. Great organizations change their mea- 
sures, and measurement processes, to continue 
making progress and to reflect changes in how 
work is done. Education must do no less. 

Leaders of successful organizations in other indus- 
tries have long recognized that talent — and strategic 
management of that talent — is the key to their 
competitiveness and success. There is no sector where 
talent is more important than education. We must 
carefully consider lessons learned from decades of 
performance measurement in other sectors, both to 
inform our current efforts and to spur the next gen- 
eration of reform. 

The steps and decisions involved in performance 
measurement form the culture in our schools, which 
in turn determines who decides to teach, how long 
they stay, what students they reach, and how moti- 
vated they feel to raise our children’s sights and 
conquer obstacles. Measuring educators’ perfor- 
mance accurately — and in ways that enable crucial 
decisions — must therefore be a top priority for re- 
form within the next half-decade. 
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Introduction 


O ur failure to meaningfully differentiate 
teachers’ and principals’ impact on stu- 
dent learning is finally beginning to re- 
ceive the attention it deserves . 2 For too long, perfor- 
mance measurement systems in education have not 
documented real differences among educators — in 
their roles, strengths, weaknesses, or overall effec- 
tiveness in improving student learning. However, a 
recent national push to use performance evaluations 
for critical personnel decisions has highlighted the 
shortcomings of our current systems and increased 
the urgency to dramatically improve them. 

How can we make important decisions about 
teacher and principal assignments, rewards, reten- 
tion, or dismissal when we cannot reliably recognize 
differences in effectiveness and impact? How can we 
offer effective, personalized professional development 
without accurate and credible information about 
educators’ individual strengths and weaknesses? 
Faced with these types of questions, many federal. 


As education shifts toward a more 
performance-focused culture, improving 
the measurement of teacher and principal 
effectiveness will be a critical step. 


state, and district education leaders have begun to 
realize that our current performance measurement 
systems not only contribute to a depersonalization 
of careers in education, but also limit our ability to 
improve student learning. 

We therefore have a great opportunity. As edu- 
cation shifts toward a more performance -focused 
culture, improving the measurement of teacher and 
principal effectiveness will be a critical step. With ac- 
curate, reliable, and meaningful information about 
educators’ performance, states, districts, and other 
providers will be able to better differentiate between 
“effective” and “ineffective” teachers. Accurate and 
complete measurement will enable better decisions 
about future recruiting, professional development, 
career paths, pay, retention, and dismissal. 

With better information about educators’ perfor- 
mance, the education sector can become a profession 
in which excellence is recognized and used to help 
more students achieve success. 
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Guideposts from Other Sectors 

Lessons about Effective 
Performance Measurement 


ortunately, decades of performance mea- 
surement research and experience in sectors 
outside education — including government 
agencies, nonprofit organizations, and for-profit 
companies — provide strong guideposts for our work 
in education. Together with a growing knowledge 
base in the school setting, these cross-sector lessons 
can help motivated education leaders make better, 
faster improvements in performance measurement 
for educators. 

In this research summary, we distill the lessons 
into six elements that cross-sector research and expe- 
rience show are critical for designing an outstanding 
performance measurement system. While the report 
is not a how-to toolkit, we organize these elements 
into a series of six steps for effective performance 
measurement: 

1. Determine the purposes of performance 
measurement, such as informing professional 
development; promotions and reach extension; 
career planning; compensation; retention and 
dismissals; and future recruiting and hiring. 

The purposes of a performance appraisal system 
should determine the rest of its design, includ- 
ing performance goals, measures, and the ap- 
praisal process. 

2. Choose objectives that align with the orga- 
nization’s mission to ensure that performance 
measures and the measurement process capture 
the critical outcomes and behaviors needed 
from each employee to achieve the school’s, 
district’s, or provider’s mission. 


3. Design performance measures, including what 
individuals in each role are expected to contrib- 
ute and the ways in which they are expected to 
achieve results. By choosing the right measures, 
organizations stimulate staff actions that lead to 
success. 

4. Set performance standards against which 
employees’ performance will be evaluated by 
defining what a fully satisfactory performance 
looks like. This step also includes decisions 
about whether to use absolute or relative stan- 
dards, how to account for changing circum- 
stances, and the scales on which each objective 
will be rated. 

5. Adopt a performance measurement process 
by determining who will organize and have 
input into evaluations, using what process, and 
how often. 

6. Use measurement results to take action, in- 
cluding making decisions about professional 
development; promotions and reach extension; 
career planning; compensation; retention and 
dismissals; and future recruiting and hiring. 

In the final section of the report, we put these les- 
sons to use by providing sample elements of a strong 
performance measurement system for teachers, and 
a detailed example of measuring the “how” of their 
performance. We also recommend several critical 
considerations for leaders in K-12 education as they 
overhaul their own measurement systems to foster a 
strong performance culture in our public schools. 
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Step One: Identify the purposes of 
performance appraisals 

The first step in developing an effective performance 
measurement system is to identify the purposes it 
will serve in the organization, and how the results 
will be used. 3 Most evaluation systems are designed 
to serve multiple purposes, which require different 
types of information: for example, decisions about 
promotion require the ability to make comparisons 
among employees, while development and training 
require a deep understanding of individual strengths 
and weaknesses. 4 Similarly, appraisals aimed at de- 
velopment may be best conducted separately from 
those for compensation, because employees may be 
less likely to embrace feedback if they believe it is tied 
to their paycheck. 5 The most common purposes of 
performance measurement include: 

> Feedback and development — to provide infor- 
mation about an employee’s performance indi- 
vidually and as a team member, so she is aware of 
her own strengths and weaknesses, and can focus 
future work on developing both. 6 In addition, per- 
formance appraisals tell the organization where 
to focus organization-wide and individualized 
professional development to improve employees’ 
skills, competencies, and overall performance. 7 

> Promotions and career planning — to identify 
employees whose performance warrants addi- 
tional opportunities and responsibilities, formal 
promotion, or a lateral career move to use and 
build on strengths; and to keep people focused on 
additional ways to achieve success and advance 
their long-term careers. 8 

> Compensation — to recognize, reward and mo- 
tivate employees through pay awards or salary in- 
creases. In the large number of organizations that 
use differentiated compensation based on perfor- 
mance, data gathered through the performance 
measurement process are typically the predomi- 
nant factor in pay decisions. 9 

> Retention and dismissals — to inform fair, 
reliable, and data-based decisions about which 


employees to keep, which require rapid improve- 
ment, and which need help seeking opportunities 
elsewhere. 10 

> Future recruiting and hiring — to correlate 
behaviors, competencies, and skills with per- 
formance among current employees, thereby 
strengthening future decisions about which can- 
didates to recruit and hire. 11 In addition, strong 
performance measurement systems that are linked 
to rewards have been shown to attract more high- 
performing and better qualified candidates into 
an organization’s talent pool, creating a virtuous 
cycle. 12 Good measurement alone is not enough. 
Prospective candidates must see that it is used to 
reward better performers. 

Engaging top leadership in conversation and deliber- 
ation about these varying purposes during the initial 
design of performance measurement systems helps 
ensure that the system provides the type and quality 
of information necessary to inform each decision. An 
organization’s primary goals for the system should 
direct both the content of the evaluations (discussed 
in the next section, Step Two) and the process by 
which appraisals are carried out (covered under Step 
Four). 

Step Two: Choose job objectives that align 
with the organization’s mission 

Ultimately, any performance measurement system 
aims to improve the performance of individual em- 
ployees and, as a result, improve the overall success of 
the organization. 13 To meet this goal, performance 
measurement systems should be tightly aligned with 
each employee’s expected contributions to the orga- 
nization’s core mission. 14 A 2007 study of historically 
high-performing companies examined common and 
disparate characteristics among their performance 
measurement systems. Among the authors’ key find- 
ings was that each company had worked to make its 
system consistent and aligned with its particular cul- 
ture and mission, to make performance measurement 
and the organization’s goals mutually reinforcing. 15 
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The second critical step successful organizations 
take in designing a strong performance measure- 
ment system, therefore, is to clarify how each job or 
job category contributes to the organization’s overall 
mission. This helps ensure that performance objec- 
tives capture the critical contributions needed from 
individuals and teams — i.e., that they are measuring 
what matters most. 16 Given the goals and direction of 
the organization, what are employees’ roles in getting 
there? Without this step, organizations add objec- 
tives and process steps that clog the measurement 
process and miss elements of a job that are essential 
to the employee’s — and the organization’s — success. 


Clarifying how each job contributes 
to the organization’s mission ensures 
that performance appraisals measure 
what matters most. 


The types of objectives that organizations often 
set for individual employees and teams can be 
grouped into two categories (see Figure i): 17 

> The “what”: What tangible results will the em- 
ployee be expected to achieve to contribute to 
the organization’s overall goals? Honing quality, 
managing costs, increasing sales, and improving 
speed and innovation — and customer perception 
of the resulting benefits — are common objectives 
in the private sector that lead to organization-level 
success. The relative importance of these objec- 
tives depends on the organization’s particular 
strategy for success. Survey evidence suggests 
that the objectives that are most easily turned 
into quantifiable measures predominate in many 
organizations. 18 

> The “how”: What skills, knowledge, and compe- 
tencies enable the employee to achieve desired re- 
sults? Objectives of this kind may include achiev- 


figure i. Two Dimensions of Employees’ Performance 



HOW 


Skills 

Competencies 

Knowledge 


ing target levels or improving on a particular skill; 
improving technical or process knowledge; or 
demonstrating patterns of thinking and behavior 
that affect an employee’s ability to achieve results, 
such as his drive for achievement or ability to in- 
fluence others. 19 


To serve the multiple purposes outlined above, most 
evaluations focus on both types of objectives. 20 Re- 
search from across sectors suggests that the best types 
of both objectives will incorporate several key char- 
acteristics. Performance objectives should be: 

> Designed with employee input. As suggested 
under Step One, top leadership in an organization 
must participate in the design of the performance 
evaluation system and its purposes. In two-thirds 
of Fortune ioo companies (the ioo top-ranked 
U.S. corporations according to gross revenue), 
decisions about performance appraisal strategy 
are made by top executives, including whether or 
not to have appraisals, and the purposes for which 
they are used. 21 

Just as important, however, is the engagement 
of employees in the selection of performance ob- 
jectives and, to the extent possible, how they will 
be measured against a standard (discussed further 
under steps Three and Four). Studies of employees 
across sectors suggest that involving them both in 
goal setting and feedback can lead to higher levels 
of acceptance of the appraisal system, higher mo- 
tivation, and, as a result, increased productivity. 22 
More than 80 percent of Fortune ioo companies 
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involve executives, managers, and professional em- 
ployees in setting their own objectives. 23 

V Clear and transparent. Clear and transparent 
objectives help explain why each element of per- 
formance is important and how it fits into the 
firm’s larger goals. They also help communicate to 
employees exactly what will be expected of them. 
By visibly aligning organizational purpose and 
individual performance objectives, organizations 
can make employees feel more empowered and 
motivated to deliver expected results. 24 In addi- 
tion, clear performance objectives make the evalu- 
ation process easier and more reliable, by signaling 
to raters how performance should be judged. 25 

> Reflective of differences among positions and 
roles. Just as different firms perform different 
functions, departments within an organization 
— and employees within those departments — 
play different roles. Sometimes, employees have 
standard jobs with performance expectations that 
are fairly consistent over time and across many 
people. In other cases, individuals take on roles 
crafted to fit the firm’s current needs and the indi- 
vidual’s unique skill combination. In both cases, 
performance objectives should reflect the actual 
jobs and roles people have in helping the firm meet 
its goals. As Dick Grote explains in his seminal 
Complete Guide to Performance Appraisal (1996), 
“the behaviors, skills and competencies that make 
an individual a good sales person are likely to be 
different from those that make for a good clerical 
employee, just as the attributes of an ideal clerk 
may differ from those of an ideal supervisor or en- 
gineer.” The same can be said for varying jobs and 
roles within a school. 

Indeed, multiple researchers in psychology have 
found that employees are most likely to be satis- 
fied with ratings that they believe reflect their per- 
formance and contributions to the organization. 
Building this type of rating, however, requires 
some level of differentiation in how employees 
within various roles are evaluated. 26 Differentia- 
tion may be as simple as adjusting the specific 


objectives for different positions, or could require 
evaluating individuals with varying measures in 
accordance with their role. 27 

>• Limited in number. How many objectives is it 
reasonable to include for a particular employee’s 
performance appraisal? If the objectives are truly 
critical to the organization’s success, an individual 
can probably accomplish only a limited number 
over the course of several months or a year. While 
there is no hard rule, few successful organizations 
expect an employee to focus on less than a hand- 
ful of objectives, or to achieve more than a dozen 
during a single appraisal period. With too few 
objectives, the organization is probably not re- 
quiring enough from the employee, or is defining 
objectives too broadly. With too many, it becomes 
more likely that the employee and the employer 
will begin to emphasize some and neglect others 
over time. 28 The most important consideration is 
to concentrate each employee within the organi- 
zation on the most critical outcomes and actions 
that will positively affect the success of the firm as 
a whole. 


How many objectives is it reasonable to 
include in a performance appraisal? If 
they are truly critical to the organization’s 
success, an individual can probably accom- 
plish only a limited number over the course 
of several months or a year. More than a 
few and less than a dozen at one time is a 
feasible range. 


> Adjusted to meet changing goals and needs. 
Organizations with strong performance mea- 
surement systems also design processes that are 
flexible enough to be adjusted in response to 
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shifts in the organization’s mission or goals, and 
to changing employee needs. At John Deere, for 
example, changing needs in the industry in the 
1980s required the organization to shift its culture 
and change its appraisal system as well. This lead- 
ing agricultural manufacturer had long evaluated 
and rewarded employees based on individual 
performance, according to internally developed 
job standards. But when the agricultural sector 
plummeted, leaders were pressed to dramatically 
increase efficiency. A key expectation for manu- 
facturing employees became their ability to work 
together and use their ingenuity to find more pro- 
ductive work methods. John Deere therefore ad- 
justed its appraisal and reward system to focus on 
team-based outcomes and successful innovations 
in accordance with its new goals and strategy. 29 
The best performance measurement systems are 
“living” systems, enabling organization leaders to 


respond to changing needs, successes, and failures 
without designing a new system from scratch 
every few years. 

After choosing the objectives that employees are ex- 
pected to meet, based on the organization’s mission 
and each employee’s role in achievement, organiza- 
tions must next determine how progress on each 
objective will be measured. This is the focus of 
Step Three. 

Step Three: Design Performance Measures 

How do organization leaders know when an em- 
ployee has met an objective? Sometimes it is obvious, 
such as when a salesperson turns in dollar figures 
showing progress toward annual sales goals, or a 
manager reduces error rates by a certain percentage. 
But other times there is not a direct measure of an 
employee’s performance relative to a given standard, 


WWW.OPPORTUNITYCULTURE.ORG 


MEASURING TEACHER LEADER PERFORMANCE | 9 





such as when the employee is charged with improv- 
ing communications or encouraging teamwork. In 
these cases, organizations must choose measures that 
are precursors and proxies for ultimate performance 
to provide a best estimate of whether employees 
have achieved an objective. Below, we examine the 
processes by which organizations in other sectors 
make these determinations for objectives in both the 
“what” and “how” categories. 

Measures of “what.” Research suggests that 
the primary factor in the majority of performance 
measurement systems is an employee’s tangible and 
observable results. 30 Sales outcomes are the most 
obvious example of a result measure, because the 
relationship between the employee’s performance 
(selling) and the desired outcome (increasing sales) 
is so direct and the performance so easily measured 
(sales volume). Where that relationship is direct and 
straightforward, organization leaders may not think 
of choosing performance measures as a step distinct 
from choosing objectives. But other outcomes can be 
similarly measured and quantified, such as speed (the 
number of times an employee engages in a particular 
activity), output (how much product is produced 
within a given time period), or market share (the in- 
dividual’s sales revenue as a percentage of all similar 
purchases by a defined customer group). 31 

Figure 2 shows the relationship between organi- 
zational goals, employee outcomes, and measures of 
performance for a hypothetical organization, with 


related standards for performance (discussed under 
Step Four). 

Measures of “how.” In many cases, an employee 
is also expected to achieve objectives that precede or 
are necessary to obtain the ultimate result, such as 
engaging in teamwork, providing high-quality cus- 
tomer service, or persisting in the face of obstacles. 

In recent years, organizations across sectors have 
increasingly begun to use these types of measures as 
critical elements of employees’ evaluations. 32 Particu- 
larly in profit-driven industries, organization leaders 
have recognized that relying upon financial measures 
alone may undercut a company’s mission or strategy 
in the longer term, and that measures of how employ- 
ees perform their roles offer a more complete and 
predictive picture of the organization’s future viabil- 
ity and health. 33 Organizations also have found that 
purely results-based evaluation models tend to over- 
simplify what the employee does, and are not helpful 
for identifying areas for development. 34 

For example, at banking giant Citibank, finan- 
cial measures long dominated performance evalua- 
tions. In the late 1990s, however, managers realized 
that these measures failed to connect employees to 
the bank’s focus on customer service and — more 
dangerously — tended to focus employees’ energy 
on short-term targets at the expense of long-term 
success. To capture the importance of non-financial 
measures, one state-level division developed an ap- 
praisal system that also included measures of the 


figure 2. Example of Relationship Between Goals, Objectives, Measures, and Standards 


GOAL 


Increase sales 


EMPLOYEE OBJECTIVE 


Sell to new 
and existing 
customers 


MEASURE 


Cases per week 

New customers 
per month 


STANDARD 


50 cases per week 

3 new customers 
per month 


Note: Examples of goals, objectives, measures, and standards for educators appear in the final section of this report, 
on page 25. 
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Many organization leaders have 
recognized that basing performance 
appraisals upon results-based measures 
alone can undercut the organization’s 
long-term mission. Including measures 
of how an employee achieves success 
provides a more accurate picture 
of performance and helps identify 
areas for development. 


“how,” such as strategy implementation and cus- 
tomer satisfaction. Strategy implementation was 
measured via revenue and broken down by types 
of customers that the division especially wanted to 
serve, such as households, businesses, and profession- 
als. Customer satisfaction was measured through 
telephone interviews with customers who had visited 
a branch during the past month. All of these mea- 
sures could be quantified and compared with perfor- 
mance standards to result in an employee’s individual 
performance rating. 35 

Rental car agency Enterprise used a similar 
method to quantify customer satisfaction, which 
it deemed critical to ensuring repeat business, and, 
thus, its employees’ and organization’s success. 

Branch managers were evaluated in part based on 
the percentage of customers who rated their branches 
5 out of 5 on customer service. To receive compensa- 
tion bonuses or become eligible for a promotion, the 
manager’s branch was required to deliver customer 
service at or above the average feedback score for all 
comparable branches. 36 

Behavioral competencies are one subset of mea- 
sures that powerfully examine the “how” of employ- 
ees’ performance. Competencies are habitual be- 
haviors that enable people to achieve results in their 


job. 37 For example, an employee may achieve out- 
standing results only by engaging in certain behav- 
iors, habits, or skills such as teamwork, persistence, 
or flexibility. These are difficult to evaluate objec- 
tively unless increasingly effective behaviors are 
clearly written for managers, peers, customers, and 
others to score. Measuring an employee’s competen- 
cies is a critical element of performance measure- 
ment, particularly for the purposes of feedback, 
development, career planning, and future selection. 38 
Many companies across industries incorporate com- 
petencies into performance measurement so that 
they can more objectively examine the qualitative 
factors that make employees successful in their jobs. 39 

For example, Coca-Cola and Mobil Oil both 
emphasize quantitative (financial and non-financial) 
results in each employee’s performance appraisal. But 
they also identify the characteristics that they expect 
from each of their employees in achieving these re- 
sults, using managers’ assessments to evaluate com- 
petencies such as communication, problem solving, 
developing others, technical knowledge, and setting 
high standards. 40 

Measuring employees’ competencies is particularly 
powerful because it provides a fuller picture of their 
performance. Competencies correlated with job suc- 

figure 3. Competencies Commonly Used in 
Performance Measurement 


Achievement Orientation 
Initiative 

Impact and Influence 
Self-Confidence 
Directiveness /Assertiveness 
Relationship Building 
Analytical Thinking 
Conceptual Thinking 
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cess are remarkably consistent for some positions de- 
spite organizational changes and redirection. Many 
of the same competencies apply across different job 
categories. 41 Indeed, through decades of rigorous 
study, researchers have determined that it is possible 
to identify specific competencies required for success 
in common job categories, and also to determine 
statistically the levels needed and relative importance 
of each competency in distinguishing great from 
average performers. 42 A list of competencies that are 
common performance distinguishers for jobs ranging 
from managers to sales personnel appears in Figure 3. 

Accounting for team-based work. In many orga- 
nizations, individuals in various roles work together 
to achieve an important objective. Organizations 
measure both team effectiveness and individuals’ 
own contributions in this context to make good 
decisions — both about team assignments and about 
individual roles. Organization leaders use various 
strategies to measure team outcomes and individual 
employees’ contributions, including peer reviews, sta- 
tistical formulas, and team-based results. 


When individuals work together to 
achieve an important objective, organiza- 
tions measure both team effectiveness and 
individuals’ own contributions to lay the 
groundwork for good decisions. 


>• Peer or 360-degree appraisals (described in more 
detail under Step Five, below) enable organization 
leaders to evaluate each team member’s individual 
contribution to a joint outcome. With feedback 
from peers and other members of a team about 
how an employee contributed to the outcome and 
in what ways, evaluators gain greater insight into 
the strengths, weaknesses, and contributions of 
individuals on the team. 43 Team members are in 
a good position to assess one another’s competen- 


cies, skills, and behaviors because they observe 
them daily. 44 

At Fortune 500 company Texas Instruments, 
organization leaders implemented a peer appraisal 
process to evaluate team-based work. They created 
a questionnaire describing 33 effective team behav- 
iors with three ratings that team members used 
to evaluate one another anonymously on a weekly 
basis. Team members were also asked to rank one 
another’s performance on the team behaviors. The 
organization then used these rankings to inform 
decisions about team assignments and bonus pay. 45 

> Sophisticated statistical formulas can also help 
parse out an individual’s contribution to a joint 
outcome. This method is most often used in 
sports, where the value added by an individual 
player is determined using statistical formulas. 
These compare team performance when an indi- 
vidual plays to team performance when that player 
is not in the game. In addition, since the makeup 
of players on the team can change, these models 
also consider how the value of players’ contribu- 
tions changes depending on who is on the field. 46 

>• Team-based results measurement is also common, 
and almost always used in combination with 
peer or individual reviews. Using this strategy, 
organization leaders hold all employees on a team 
equally accountable for the team’s results. Orga- 
nizations deliver a uniform performance rating 
to all members of a team who share responsibility 
for obtaining results, based on the team’s overall 
achievement of its objectives. 47 

A serious drawback of using team-based results 
exclusively, however, is what researchers Susanne 
Scott and Walter Einstein refer to as “social loaf- 
ing,” which is likely to result in any sector when 
individual effort is not evaluated and recog- 
nized. 48 Without assessment and action to address 
loafing, other team members are likely to hold 
back their efforts as well, leading to a decrease in 
performance among the team overall. To prevent 
this phenomenon, appraisals of team performance 
systems often reflect measures of “what” (such as 
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productivity, sales volume, or customer satisfac- 
tion) and are used in combination with “how” 
assessments of individual performance . 49 Many 
team performance measurement systems, such as 
those used at technology firm Xerox, are devel- 
oped jointly by teams, managers, and customers 
for informal and formal appraisals . 50 

By identifying the key performance measures that 
underlie successful performance in a job — includ- 
ing the “what” and the “how” that contribute to 
success, and the manner in which employees work 
together to achieve it — organizations can define the 
standards by which employees will be measured. In 
other words, they can begin to answer the question, 
“What does excellent performance look like?” In the 
next sections, we examine the process through which 
organizations answer the companion question, “How 
do we know?” by setting standards and designing a 
performance measurement process. 

Step Four: Set Performance Standards 

After determining the objectives that employees will 
be expected to achieve and the measures by which 
their progress will be evaluated, organization lead- 
ers must determine what level of performance on 
each measure — and overall — is expected by setting 
performance standards. In education, setting perfor- 
mance standards is controversial, but in most other 
sectors, it is routine. This step typically includes 
designing performance categories — the scales to use 
for rating each measure — and determining which 
category will serve as the employees’ target, and what 
is superior or unacceptable performance. 

Designing performance categories. Setting 
performance standards requires first defining the 
categories into which various levels of performance 
are grouped. The primary design tasks here are deter- 
mining the number of categories and choosing their 
labels, or descriptions that help ensure consistency 
and accuracy in ratings. 

> Number of Rating Levels. Across sectors, the ma- 
jority of organizations include from three to five 


levels in their performance measurement systems. 
Performance can rarely be meaningfully assessed 
on a pass-fail basis, and it is not often useful to 
distinguish among more than five levels of perfor- 
mance . 51 A 1994 survey of Fortune 100 companies 
found that most (60 percent) used appraisal sys- 
tems with five levels of performance. Twenty per- 
cent of the surveyed organizations used more than 
five, and 20 percent used fewer than five . 52 

There are advantages and disadvantages to 
each number of performance categories within 
the three-to-five range. Fewer categories may 
not allow for fine enough distinction between 
fundamentally different levels of performance, 
including outstanding performers or those who 
cannot improve. But fewer levels can foster greater 
consistency among raters. A larger number of per- 
formance categories make ratings more complex 
and tend to increase the chances that some levels 
will not be used at all. On the other hand, hav- 
ing more categories allows leaders to recognize 
truly excellent performance and to distinguish 
between low performers who can improve and 
those who should leave the organization. One 
particular advantage of an even number of perfor- 
mance categories (most often, four) is elimination 
of the frequently-used midpoint position, or the 
perception of “average” performance . 53 The most 
important quality of the performance categories 
is that they be clearly defined, so that expectations 
are clear among supervisors, ratings are more con- 
sistent, and employees understand the meaning of 
the designation they receive . 54 

Even with a carefully selected number of per- 
formance categories, however, decades of research 
indicate that leniency and rating inflation are 
common problems across all types of industries. 55 
For example, in the survey cited above, among the 
80 percent of companies that used five or more 
levels of performance, raters actually used only the 
highest three levels (see Figure 4). Very few em- 
ployees in these companies were rated in the bot- 
tom two performance levels. The norm, instead, 
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figure 4. Performance Distribution among Fortune 100 Companies 



Far exceeds Exceeds objectives Fully meets Partially meets Unsatisfactory 

objectives objectives objectives 


Executives I Managers Professionals Non-Exempts 


Source: Based on data presented in Steven L. Thomas 8 ^ Robert D. Bretzjr. “Research and Practice in Performance 
Appraisal: Evaluating Employee Performance in America’s Largest Companies.” SAM Advanced Management Journal 
(Spring 1994). 


was a leniency bias, with most employees rated at 
the top end of the scale. 56 

> Category Descriptions. Organizations can over- 
come some of the challenges outlined above by 
defining categories very specifically in terms of 
descriptively scaled behaviors and quantitatively 
defined categories. Quantitative categories can 
be clearly labeled with ranges, so that ratings are 
highly objective and subject to fact-checking. 
Qualitative measures also can be described clearly 
and sometimes transformed into quantitative, 
scaled categories. For example, rather than just 
having customers rate on a scale with such cat- 
egories as “unsatisfied, satisfied, and very satis- 
fied,” ratings can be more specific. On a rating 
of timeliness of checkout at a car rental service, 
the rating could have categories such as “under 
3 minutes; 3-5 minutes; 5-10 minutes; and over 
10 minutes,” making comparisons far more con- 
sistent. Details like these can be correlated with 
other measures so that the organization not only 
has information about individual employees, but 


also can set clearer, more specific targets in the 
future. 

Even the most qualitative measures like be- 
havioral competencies can be broken down into 
scaled levels, with descriptions of behavior that 
distinguish high levels of the competency from 
middle and low, as shown in Figure 5. Scaled mod- 
els enable organization leaders to classify the types 
of behavior they observe among their employees 
into meaningful, consistently scaled measures of 
performance, thus translating qualitative behav- 
iors into quantified data. 57 Managers can then set 
standards for the level of rating on a given compe- 
tency that is required for success. 

Determining performance standards. By clearly 
identifying standards of excellence — both on in- 
dividual measures and on employees’ overall per- 
formance — leaders help align both expectations 
and performance with the organization’s mission. 58 
Across sectors, organizations set standards for em- 
ployee performance in both the “what” and “how” 
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figure 5. Sample Levels of Impact and Influence Competency 


DESCRIPTION BEHAVIORS 


1 

Personal Gain Tactics, 
Limited Persuasion 

Uses negative behaviors for personal gain, or does not act to influence others. 

2 

No Adaptation to Audience 

Prepares and presents data and logical arguments, but does not tailor to make 
them appealing or influential to the specific audience. 

3 

Tailors Single Action to Influence 
Audience 

Thinks ahead about the likely reaction of audience, and adapts communication 
to obtain desired impact. 

4 

Tailors Single, Dramatic Action to 
Influence Audience 

Takes one, dramatic action chosen to obtain a specific reaction from audience. 
(Threats do not count.) 

5 

Tailors Two Actions to Influence 
Audience 

Thinks ahead about the likely reaction of audience, adapts communication 
to obtain desired impact, and shows “influence tenacity” by taking two (not 
necessarily dramatic) steps to influence. 

6 

Tailors Three Actions or Uses Indirect 
Influence 

Takes three or more steps chosen to influence, or uses third-party experts or 
trusted individuals to influence others, or obtains individual support “behind 
the scenes,” or chooses timing and delivery/withholding of information to 
influence. 

7 

Complex Influence 

Engages in a complex set of maneuvers with many people — personal 
communications, use of third parties, promotion decisions, sharing of power 
or information, working through chains of people for a “domino” influence 
effect — to obtain desired impact. 


Source: Lyle Spencer and Signe Spencer, Competence at Work (New York: John Wiley, 1994). 


categories by considering what is necessary to achieve 
the ultimate objectives and what is feasible within 
the constraints of the job. Key issues include decid- 
ing whether to use absolute or relative standards and 
how to account for changing circumstances. 

Standards for output, or “what” measures, for 
example, might include selling 50 cases per week and 
establishing three new customers in the course of a 
month for a salesperson (see Figure 2 above). Other 
examples include the frequency expected of specific 
tasks and a specified time limit to complete a particu- 
lar task. Standards for the “how” of an employee’s 
performance typically describe the type of skill or 
behavior expected in terms of its frequency, quality, 
or completeness. For example, an employer might set 
a target rating on customer satisfaction surveys, or 
identify target levels of behavioral competencies that 
are expected in a job or role. 

In addition to defining performance standards 


for individual objectives, organizations must also 
set standards for employees’ performance overall. 
These standards typically involve a roll-up of ratings 
on individual objectives, with some degree of weight 
assigned to each based on their importance to the 
employee’s and the organization’s success. 59 

> Using relative (versus absolute) targets. The most 
straightforward method of evaluating employees 
is by judging their performance relative to a given 
standard, as described above. 60 Another approach 
to rating employees, however, evaluates their per- 
formance in comparison with one another rather 
than against a set standard. 61 Performance rank- 
ing or “forced distribution” systems have been 
adopted among an increasing number of highly 
successful companies in large part to address the 
challenges of leniency and rating inflation, and to 
help inform decisions about compensation and 
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Performance ranking systems evaluate 
employees in comparison with one 
another rather than against a set standard, 
in part to address common challenges 
of leniency and rating inflation. 


promotion. 62 By some estimates, as many as 20 to 
30 percent of all companies use performance rank- 
ing in some form. 63 

Performance ranking typically involves assign- 
ing performance ratings to managerial and execu- 
tive staff using forced distribution along a curve 
(e.g., 20 percent rated “most effective,” 70 percent 
rated “effective,” and 10 percent rated “needs im- 
provement”). Some methods rank employees in a 
particular work group from best to worst using a 
“totem pole” approach. Other approaches place 
employees or teams into one of a set number of 
performance groups based on a fixed distribution 
(e.g., 23 percent into each of four groups). 64 

General Electric is the best-known and most 
widely cited example of a successful forced rank- 
ing system. Former Chief Executive Jack Welch 
required managers to divide talent into three 
groups — the top 20 percent, middle 70 percent, 
and bottom 10 percent. The top-ranked perform- 
ers were eligible for substantial performance 
awards, and many in the bottom 10 percent were 
shown the door. 65 In recent years, GE has soft- 
ened its approach, continuing to use a curve for 
performance appraisal but using the 20-70-10 
categories as general guidelines to inform pay and 
development decisions, rather than as required 
categories with default consequences. 66 

To better recognize, reward, and retain its top 
talent in highly competitive Silicon Valley, inter- 
net search giant Yahoo recently instituted a totem 
pole approach to inform the allocation of salary 
increases and bonuses. Managers are asked to 


rank employees within each unit and to distribute 
raises and bonuses accordingly. During reviews, 
they also discuss with employees how their raises 
compare with those of their colleagues. 67 

Forced distribution systems are one way to 
overcome the pervasive tendency toward rating 
inflation and reinvigorate performance measure- 
ment that has become a rote exercise. 68 In addi- 
tion, well-designed programs have been shown to 
be a highly effective way of dramatically increasing 
overall organizational results. In 2005, a group of 
university researchers constructed a sophisticated 
mathematical simulation of a forced ranking pro- 
cess across 100 companies. They found that forced 
ranking, including the firing of the bottom 10 per- 
cent of employees per year, resulted in an annual 
16 percent gain in organization-wide productivity 
for the first two years. 69 

Forced distribution programs have many critics, 
however, for their potential to negatively affect em- 
ployee morale, teamwork and collaboration, future 
recruiting, and the opinions of external stakehold- 
ers. 70 Research suggests that the benefits of forced 
distribution drop off after the first two years, from 
6 percent increases in productivity in the third 
and fourth years to essentially zero by year io. 71 
Therefore, despite these systems’ strengths, many 
of even the most ardent supporters recommend 
implementing them for only a few years. 72 

> Accounting for idiosyncrasies. In any work environ- 
ment, there will inevitably be factors outside the 
organization’s and employees’ control that will 
nonetheless affect performance. For example, in 
the sales setting, a salesperson might meet her 
objective of selling 50 cases per week — and thus 
earn a satisfactory performance appraisal. But if, 
due to a shifting marketplace and consumer pref- 
erences, the salesperson’s market had actually ex- 
panded by 40 percent during the appraisal period, 
simply meeting this standard would not actually 
be satisfactory performance. Under the new cir- 
cumstances, the firm might expect 70 cases per 
week instead. 73 
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Similarly, the character of a firm’s neighbor- 
hood, the client base that it serves, or the degree of 
local market competition can change within a per- 
formance appraisal period and affect the results 
that an organization expects from its employees. 
Organizations across sectors rely on a variety of 
methods to take these changes into account when 
setting standards and evaluating employees’ per- 
formance . 74 They do not stop measuring perfor- 
mance or setting standards. They simply change 
the measures and standards to fit new realities. 

For example, in sales and many other settings, 
firms often account for changes and idiosyncra- 
sies by benchmarking how other employees have 
achieved in similar situations, and setting stan- 
dards for their own staff in comparison . 75 Simi- 
larly, some firms set standards for individual per- 
formance that automatically account for external 
conditions, such as changing competitive environ- 
ments or socio-demographic characteristics. Using 
statistical modeling, these organizations estimate 
the results they expect for individuals or depart- 
ments, which then serve as a point of comparison 
by which to evaluate actual performance . 76 

Step Five: Adopt a Performance 
Measurement Process 

With objectives and measures set for each employee, 
standards set, and a rating or ranking process in 
place, the key remaining design issue in a perfor- 
mance measurement system is the process by which 
performance reviews are conducted. Research from 
the fields of psychology, employee engagement, and 
performance management suggests that how per- 
formance appraisals are carried out is at least as im- 
portant to employee and organization success as the 
performance measures upon which they are based . 77 
In this section, we explore the key decisions for orga- 
nization leaders in designing such a process, includ- 
ing determining who will carry out evaluations and 
how often. 

Who conducts evaluations? Organizations take 
different approaches to the performance evaluation 


The process that leaders design to 
carry out performance appraisals is at least 
as important to success as the performance 
objectives they choose to assess. 


process, with the majority assigning responsibility 
for appraisals to immediate supervisors. Many incor- 
porate feedback from other players in the appraisal 
process, however, including peers, subordinates, cus- 
tomers, and employees themselves. 

> Reviews conducted by the direct supervisor. In the 
great majority of organizations, the employee’s 
direct supervisor oversees performance apprais- 
als, including collecting hard data, evaluating the 
employee’s competencies and progress, and con- 
ducting the appraisal discussion . 78 This is largely 
because the direct supervisor typically has the 
most direct contact with and authority over the 
employee, and therefore is likely to best under- 
stand the nature of the employee’s work and over- 
all performance . 79 

While supervisors are the most common 
evaluators and have clear advantages in this role, 
research and experience suggest some potential 
shortcomings to their ratings. For example, re- 
search indicates that manager evaluations can 
be unreliable for a number of reasons, including 
the manager’s desire to improve perceptions of 
performance within his or her own workgroup, 
lack of time or energy to dedicate to the task, and 
misunderstandings about the process or goals of 
performance measurement . 80 In addition, supervi- 
sory ratings have been shown to contribute to the 
leniency and rating bias described above, as man- 
agers seek to avoid uncomfortable confrontations, 
encourage an employee to improve, or move an 
unsuccessful employee into a new work setting . 81 

Research and experience from other sectors 
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suggest two methods for overcoming shortcom- 
ings of supervisors’ appraisals. First, holding man- 
agers accountable for the quality and substance 
of their appraisals can improve the reliability and 
meaningfulness of the performance measurement 
process. 82 In an extensive review of evaluation 
practices among Fortune ioo companies, research- 
ers found that less than 25 percent of managers 
were evaluated on how they conduct performance 


By holding managers accountable for the 
quality and substance of their appraisals, 
organization leaders can improve the 
reliability and meaningfulness of the 
performance measurement process. 


appraisals or award performance ratings. By incor- 
porating the outcomes of appraisals into the man- 
ager’s own evaluations, organization leaders can 


encourage them to assign more accurate ratings 
and invest more in the process. 83 

Providing detailed and frequent training is 
another tactic some experts support to produce 
meaningful, fair, and consistent evaluations, given 
the complexity of the decisions managers must 
make during the performance measurement pro- 
cess. 84 But in fact, evaluators typically receive lim- 
ited training on evaluation processes, with little 
follow-up beyond initial training sessions focused 
on providing feedback, using appraisal forms, or 
avoiding rating errors. 85 For example, a 1991 survey 
of 22 civil engineering firms found that less than 
40 percent of reviewers received training in the 
appraisal process. 86 

>- Reviews conducted by other stakeholders. Many 
organizations across sectors include others in the 
evaluation process in addition to the direct su- 
pervisor, including higher-level supervisors, peers, 
subordinates, and customers. These additional 
perspectives can reduce the likelihood of bias in 
the evaluation process and provide a broader pic- 
ture of the employee’s performance. 87 
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One popular method for incorporating feed- 
back from a broader range of stakeholders is the 
360-degree review, which includes appraisals 
from managers, coworkers, subordinates, and the 
employee to provide a “360-degree” picture of the 
employee’s performance. 88 Many organizations 
have found these reviews to be a critical element 
of their performance measurement process, both 
for providing a fuller view of performance and 
increasing employees’ buy-in into the process. 89 
Microsoft, for example, asks all employees to 
complete appraisal forms about their manager’s 
performance on an annual basis. Questions cover 
the degree to which the manager empowers the 
employee to do his job, provides an environment 
that motivates employees to achieve their goals, 
and recognizes and rewards information and 
creativity. 90 This type of feedback is particularly 
helpful for employee (and manager) development, 
by identifying areas of strength and weakness to 
inform future training or development opportu- 
nities. 91 At Microsoft, the process also serves to 
reinforce core values throughout all levels of the 
organization. 

It is not clear from the research, however, that 
multisource appraisals increase the validity of final 
performance ratings, or that they should be used 
for purposes beyond employee development . 92 
Other colleagues and subordinates often suffer 
from the same leniency and rating bias as direct 
supervisors, frequently inflating ratings to avoid 
confrontations or increase the likelihood that the 
employee will assign them a high performance rat- 

• • 93 

ing in return. 

When multisource feedback is used in final 
performance ratings, reviews from those other 
than the supervisor are typically weighted less 
heavily in the total performance rating. In their 
survey of Fortune 100 companies, Thomas and 
Bretz found that the rating from an employee’s 
immediate supervisor is often weighted most 
heavily (comprising between 42 and 74 percent 
of the total performance rating) followed by the 


figure 6. Relative Importance of Reviewer Feedback 
among Fortune 100 Companies 



Source: Based on data presented in Steven L. Thomas 
^ Robert D. Bretz, Jr. “Research and practice in perfor- 
mance appraisal: evaluating employee performance in 
America’s largest companies.” SAM Advanced Management 
Journal (Spring 1994). 

higher-level supervisor (18 to 36 percent), other 
colleagues (3 to 15 percent) and peers (o to 2 per- 
cent) (see Figure 6). 94 Former General Electric 
CEO Jack Welch, a vocal proponent of 360-de- 
gree reviews, argues that the process is primarily 
helpful when used in the course of evaluations 
every second or third year, to provide an opportu- 
nity for otherwise silent colleagues to offer their 
perspective. 95 

> Reviews conducted by the employee. Some compa- 
nies allow employees to offer a self-evaluation as 
part of the overall appraisal process. Self-reviews 
can provide employees with an opportunity to 
examine their strengths and weaknesses, and to 
discuss areas for growth with a supervisor. In ad- 
dition, it has been suggested that evaluating one’s 
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own effectiveness can enhance the assessment pro- 
cess by increasing buy-in in the process and impact 
of review feedback. 96 

Several studies conducted under a variety of 
research conditions show that self- appraisals are 
especially subject to leniency bias, however, and 
therefore have low validity for objectively assess- 
ing performance. 97 In the survey of Fortune ioo 
companies cited above, companies did not typi- 
cally use ratings resulting from self- appraisals in 
the formal evaluation process, or they assigned 
them very little weight (about i percent) in an em- 
ployee’s final rating. 98 

How frequent are performance appraisals? Across 
sectors, most organizations conduct formal perfor- 
mance evaluations once a year. A 2010 survey of more 
than 500 human resources managers in large and 
midsize companies found that 60 percent conducted 
performance appraisals annually. 99 These typically 
consist of performance feedback, including discus- 
sion of the employee’s strengths and weaknesses, as 
well as management decisions about issues such as 
compensation, promotions, or training. 100 

Many experts advocate for performance measure- 
ment systems that incorporate more than one ap- 
praisal per year, however, particularly for purposes 
of employee feedback and development. In a survey 
of managers in 22 civil engineering firms, research- 
ers identified evaluating employees at least twice a 
year as a “strong point” of the performance appraisal 
process. 101 A 2008 survey of accountants working in 
government, industry, and public accounting found 
that the majority of respondents would prefer one 
more evaluation than they had, for a total, most 
commonly, of two per year. 102 Former GE CEO Jack 
Welch also argues for “rigorous and candid” per- 
formance evaluations at least semiannually to give 
employees an opportunity to know where they stand 
relative to the organization’s standards and their 
colleagues. 103 

The research evidence does not suggest an ideal 
frequency of performance evaluations for any given 
employee or organization. Instead, experience sug- 


gests that multiple criteria should guide the fre- 
quency of performance appraisals, including: 

> The availability of neiv and meaningful employee 
data. New and meaningful data are not always 
available to warrant performance discussions with 
employees more than once or twice per year. 104 
Formal evaluations based on interim collections 
of data could cause both the manager and the em- 
ployee to direct their efforts in shortsighted ways 
or to reward or sanction only short-term perfor- 
mance. On the other hand, sometimes new and 
critical data become available to warrant a formal 
appraisal, and the ideal performance measurement 
system will allow sufficient flexibility for these dis- 
cussions to occur. 105 

> How the appraisal will be used. Decisions about 
the frequency of performance appraisals should 
also be informed by the purposes for which they 
will be used. For example, many human resource 
experts suggest that evaluations used for devel- 
opment purposes should occur more often than 
summative assessments used primarily for pay or 
promotion. More frequent feedback on employees’ 
strengths and challenges can enable them to more 
quickly redirect their behaviors and pursue more 
rapid growth. 106 

> The benefit of the result relative to its costs. Perfor- 
mance evaluations consume valuable resources, 
both in terms of managers’ and employees’ time 
and the organization’s resources. While evidence 
suggests that most managers do not spend enough 
time conducting thorough and meaningful per- 
formance appraisals, researchers estimate that the 
average manager still devotes between four and 
eight hours per employee to the process over the 
course of the year. 107 In addition, the development 
of quantitative and qualitative performance mea- 
sures and tools throughout the process requires 
up-front investment from an organization and 
regular updating and revision. Therefore, apprais- 
als should be used as often as organization leaders 
believe that benefits justify their costs. 108 
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Step Six: Use Measurement Results 
to Take Action 

One of the primary goals of performance measure- 
ment is to enable organization leaders to make fair, 
informed, and meaningful management decisions 
that increase the effectiveness of the organization. A 
strong appraisal system will arm organization leaders 
with data to make good decisions about develop- 
ment, rewards (pay increases and bonuses), career op- 
portunities and advancement, and future recruiting 
and selection. 

In addition, a well-documented process can help 
protect the firm with records of performance when 
dismissals are necessary. Each of these areas warrants 
its own report of best practices. In this section, we 
briefly discuss the prevalence and value of each use 
of performance measurement data and provide ex- 
amples of how they play out in organizations across 
sectors. 

> Development. A key purpose of performance 
appraisals, as suggested under Step One, is to 
provide feedback to employees about their perfor- 
mance and further develop their capacity to per- 
form well . 109 Survey evidence suggests that, across 
sectors, this is among the most common uses of 
performance evaluations . 110 

For example, at Public Service Electric & Gas, 
one of the largest investor-owned utilities in the 
country, managers used performance appraisals to 
initiate discussions with employees about specific 
short- and long-term areas for development. They 
also found, however, that as a byproduct of the 
process, because employees had more regular in- 
sight into their performance, they were able to un- 
derstand the causes of certain challenges on their 
own. Instead of speculating about why customers 
weren’t satisfied with repair services, for example, 
workers and supervisors could look at consumer 
satisfaction metrics, see the reasons for those com- 
plaints, and make a point of improving particular 
aspects of the process in the next quarter . 111 

Appraisal conversations that focus on growth 


A strong performance measurement 
system arms leaders with data to make 
good decisions about employees’ develop- 
ment, rewards, career opportunities and 
dismissal — as well as to inform the organi- 
zation’s future recruiting and selection. 



and development offer a number of benefits for 
employees and the organization as a whole, includ- 
ing providing meaningful opportunities for em- 
ployee engagement, identifying common training 
and development needs across the firm, and set- 
ting future objectives to improve overall success . 112 

Cross-sector research and experience suggest 
that three key ingredients of a performance mea- 
surement system can make the most of these dis- 
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cussions. Primary among these is for the appraisal 
to provide meaningful information about how the 
employee is performing, including specific areas 
for improvement and the basis for concrete next 
steps to develop necessary skills or enhance exist- 
ing ones. 113 To ensure that this type of informa- 
tion is included in the final appraisal, evaluations 
should include multiple sources of qualitative data 
through methods such as 360-degree reviews or 
individual development reviews with a supervi- 
sor. 114 The appraisal process should also provide 
an opportunity for the employee to understand 
what level of improvement is necessary to obtain 
rewards or recognition, such as promotions or 
salary increases, and the time period in which im- 
provements will be expected. 115 

>- Rewards. Along with employee development, 
administering salaries and financial rewards is the 
most common purpose of performance apprais- 
als. 116 In their survey of Fortune 100 companies, 
Thomas and Bretz found that 92 percent used 
information from performance appraisals to de- 
termine merit pay increases for employees. 117 At 
Citibank, for example, branch managers’ final per- 
formance evaluations were linked to annual bonus 
determinations under a very straightforward sys- 
tem. Managers who received a rating “below par” 
received only standard base pay. A “par” rating 
generated a bonus of up to 15 percent of the base 
salary, and an “above par” rating could lead to a 
bonus as high as 30 percent of the manager’s base 
pay. 118 Surveys of high-performing governmental 
organizations suggest that most also link perfor- 
mance appraisals in some way to employees’ pay. 119 
In addition, many organizations that incorporate 
forced distribution into their performance mea- 
surement systems rely upon rankings of employees 
to determine bonuses and pay increases. 120 

Research and experience from across sectors 
suggest that to inform decisions about pay, perfor- 
mance measurement systems should incorporate 
three key features. First, as outlined under Step 
Two, appraisals must be based upon fair measures 


of performance and focus on all aspects of the 
job that are important to the employee’s suc- 
cess. 121 They should also be paired with frequent 
feedback on progress. 122 As a technical matter, 
however, actual conversations about pay increases 
should be conducted separately from those about 
the employee’s development, if possible. Research 
is mixed on the advantages of combining or sepa- 
rating the two, but generally suggests that when 
the two are combined, reflection on past perfor- 
mance and the amount of the performance award 
can eclipse important conversations about oppor- 
tunities for future growth and development. 123 As 
management expert Dick Grote explains, “concen- 
trating the performance appraisal discussion on 
the appraisal itself and assigning the compensa- 
tion discussion to a separate meeting can ensure 
maximum mileage from both.” 124 
>- Dismissal. Organization leaders also rely upon 
performance appraisals to make management 
decisions about demotion and dismissal. 125 When 
organization leaders must make these decisions, 
strong performance measurement systems can 
provide the necessary record and justification to 
support termination or demotion, and protect 
the organization and its managers should the 
decision be challenged in court. 126 Of course, no 
performance measurement system can protect 
an organization from unwarranted terminations 
or those based on illegal grounds such as the em- 
ployee’s race, age, sex, religion, or other protected 
characteristics — nor should they. But when an 
employee’s performance warrants removal or other 
similar action, performance measurement systems 
that include frequent evaluation and documen- 
tation of an employee’s work, fair measures of 
performance, and reasonable opportunity for the 
employee to improve can help uphold the organi- 
zation’s decision. 

Chinese appliance manufacturer Haier, widely 
recognized as having one of the world’s strongest 
performance management systems, regularly 
uses data from performance appraisals to inform 


22 | MEASURING TEACHER SI LEADER PERFORMANCE 


WWW. OPPORTUNITYCULT URE.ORG 


dismissal decisions. Under its forced distribution 
program, Haier annually dismisses the lowest 
io percent of employees through a three-phase 
process. In the first annual or quarterly review in 
which an employee is in the bottom io percent, he 
or she is put “on leave” and sent for job training at 
Haier’s expense. If he or she remains in the bot- 
tom io percent during the second review, the em- 
ployee is required to attend a second round of job 
training, but this time at his or her own expense. 
Thereafter, continued performance in the bottom 
io percent results in automatic dismissal. 127 

> Future Recruiting and Selection. Organizations 
can also use strong performance evaluations to 
correlate objective results data with the behav- 
iors, competencies, and skills of an organization’s 
highest performers — that is, to analyze and 
identify relationships between the “what” and 
“how” objectives outlined in Step Two. Because 
competencies can also be evaluated before a can- 
didate actually performs on the job, information 
about the behaviors and skills of high performers 
enables organizations to recruit and screen new 
candidates based on characteristics that have been 
shown to matter in a particular job. Indeed, hir- 
ing processes based on rigorous assessment of the 
competencies that correlate with strong perfor- 
mance among current employees are one of the 
most cost-effective strategies for improving the 
accuracy of selection. 128 In addition, when a strong 
performance measurement system enables tying 
performance to pay, research indicates that higher- 
quality and higher-performing employees dispro- 
portionately choose to work with the organization 
in the future. 129 

In 2009, researchers used this method to 
validate and refine performance evaluations and 
recruiting in the British Royal Navy. They used a 
competency-based interview method to correlate 
behaviors and skills of officers with the Navy’s 
own appraisals of officers’ performance. The re- 
sults showed four competency clusters associated 
with high performance, which the Navy thereaf- 


ter adopted as the basis for all screening and selec- 
tion of leadership and management candidates. 130 
Many other private sector organizations that 
incorporate competencies into their performance 
measurement process are similarly able to correlate 
competencies with high performance and use the 
results to guide future recruiting and selection. 

Research and evidence from across sectors show that 
a strong performance measurement system enables 
organization leaders to make critical decisions about 
employees, strategies, and firm-wide success. By con- 
sidering each of these six critical steps, leaders in a va- 
riety of organizational settings have built strong per- 
formance measurement systems to foster excellence 
and continuous improvement among their employees 
and increase overall organization success. 
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Applying Cross-Sector Lessons to 
Performance Measurement of Educators 


T he research and experience base on per- 
formance measurement across sectors is 
so rich in large part because organization 
leaders in other industries know that talent — and 
strategic management of that talent — is the key to 
their competitiveness and success. 

Yet there is no sector where talent is more im- 
portant than education. We have a growing body 
of research showing that teachers’ and principals’ 
contributions to student learning vary widely, and 
the differences between the best and worst educa- 
tors dramatically alter students’ lives and future 
chances of success. Measuring educators’ perfor- 
mance accurately — and in ways that enable crucial 
decisions — must therefore be a top priority within 
education as it is in other sectors. Improving perfor- 
mance measurement for teachers and principals is 
not just a step but the very foundation of refocusing 
our public education system on achieving student 
outcomes. 

Fortunately, education leaders have a great deal to 
learn from many other sectors and organizations that 
have led the nation in performance measurement. 
These sectors have dealt with many of the same 
challenges facing education leaders, and they have 
devised, studied, and time-tested solutions to address 
those challenges. We can use these lessons to inform 
performance evaluation reforms for teachers and 
principals, such as developing objectives, measures, 
and standards according to the best practices across 
sectors, and using them to guide management deci- 
sions in education. 

Figure 7 draws from the cross-sector lessons to 
present an example of a performance measurement 
system for teachers. Table 1 offers an example of how 


the levels and behaviors of a specific competency can 
be displayed for performance measurement. These 
are not meant to be prescriptive but are simple il- 
lustrations of how the lessons about performance 
measurement from other sectors can be applied in 
the education setting. 

In addition to these sample models, we offer a few 

additional recommendations for education leaders 
committed to improving performance measure- 
ment for teachers and principals: 

> Measure more to measure better. Too often in 
education, our fear of imperfect measures trumps 
the will to improve evaluation systems at all. 
Districts, schools, and other providers that are 
committed to improving the performance of pro- 
fessionals who enter and stay in education — and 
how much they contribute on the job — will 
measure fearlessly and frequently, and use the 
shortcomings of current systems to spur constant 
improvement. With a simultaneous commitment 
to use the lessons and measurement tools avail- 
able today and improve them in the future, we can 
rapidly advance performance measurement — and 
contribute to improved student outcomes. 

> Measure both the “what” and “how” of educa- 
tors’ performance. Because the education sector 
has such limited evidence about the process skills 
and competencies that contribute to outstanding 
work, many have become distracted by contro- 
versies about evaluations that are based only on 
student results. But the research outlined above 
reveals the clear advantages to including measures 
in evaluations of student results and educators’ 
skills and competencies. Combining both kinds 
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figure 7. Sample Elements of a Performance Measurement System for Teachers 


Goal 


Objectives 


Measures 


Standards 


Process 


Uses 


1 


1 


1 


1 


1 



Prepare students 
for success in 
college, career, 
and life 


Achieve 
significant 
learning gains 
for all 
students 


Demonstrate 
competencies 
correlated with 
positive student 
outcomes 


Student growth 
on state 
assessments 
(all students) 


Progress 
on ratings of 
student work 
(using standard- 
ized rubrics) 


Ratings of com- 
petency levels by 
trained observers 
(see Table 1) 


Growth rate for all 
students, as 
defined by other 
teachers’ rates in 
previous years 





Student growth 
on state 
assessments 
(subgroups) 

Growth rate for all 
student subgroups, 
as defined by other 
teachers’ rates w/ 
similar students 




Student 
achievement 
on state 
assessments 
(% at grade level) 


State proficiency 
targets 




Meet rating targets 
and progress goals 
on each rubric 
element 


Threshold or 
superior levels 
of competence 
correlated with 
target student 
outcomes 


Principal Role: 

evaluation twice 
per year (using 
student work 
review and stu- 
dents’ ratings 
of teacher’s 
competencies 
midyear; state 
assessment 
results and peer 
and observer 
competency 
ratings at year’s 
end) 


• Development 

• Tenure review 

• Promotion 

• Compensation 


Development 
Career paths 


Selection 


Note: Blue boxes describe the objectives, measures and standards related to the “what” of a teacher’s job, while white boxes 
describe the objectives, measures, and standards related to the “how” of their success. Tan boxes apply to both categories. 


of measures will increase the developmental value 
of evaluations and lay the groundwork for better- 
informed decisions about compensation, promo- 
tion, and dismissal. 

> Conquer fears of measuring student learning. 

While measures of skills and competencies have 
a critical role in educators’ evaluations, district 
and state leaders must also consider data about 


teachers’ and leaders’ contributions to student 
progress. After all, schools are in the business of 
learning — and students’ academic growth is the 
primary outcome. Rather than fear student learn- 
ing results, we must measure them — and commit 
to improve our measurement over time, rather 
than using imperfections as an excuse to resist 
such measures entirely. 
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table i. Sample Levels of Achievement Competency for a Teacher 


Zone 

Level 

Description 

Behaviors 

Red Flag 

1 

Low concern for work quality 

Shows little concern for quality of work, or preoccupied by non- 
work matters 

Neutral 

2 

Wishes to do job well 

Expresses desire to do the job well but does not make measurable 
improvements or have a clear standard of excellence 

Threshold 

3 

Moderate concern for work or quality 

Works to do tasks and meet standards required by principal/ 
management or makes voluntary improvements, but with no 
specific goal in mind, or to meet only modest goals 

4 

Strong concern for work goals and 
quality improvement 

Sets challenging but realistic work goals for self, students, or other 
adults and acts to meet them, or sets challenging goals for self and 
students and monitors progress 

Superior 

5 

Prioritizes goals and tasks based on 
impact relative to effort 

Carefully chooses challenging goals and actions towards goals (for 
self and students) based on cost-benefit analysis: time, money, and 
other resources needed, versus the speed and magnitude of results 

6 

Pursues high-risk goals and 
improvement 

Commits significant resources and time to reach a very challenging 
goal without being sure of success, and takes multiple actions to 
minimize risk (e.g., conducting research, anticipating barriers, plan- 
ning ahead, engaging others to help) 


Source: Public Impact (2008). School Turnaround Teachers: Selection Toolkit, with competencies derived from Spencer 
Spencer (1993). Competence at Work; Hobby, Crabtree and Ibbetson (2004). The School Recruitment Handbook, A Guide to 
Attracting, Selecting and Keeping Outstanding Teachers; and Haberman (1995). Star Teachers of Children in Pouerty. 


> If a measure does not predict success, change it. 
Great organizations change their measures, and 
measurement processes, to improve their validity 
and to reflect changes in how work is done. Edu- 
cation must do no less. 

> Meaningfully assess team performance. Many 
employees’ contributions occur largely at the team 
level, and successful organizations across sectors 
have developed careful strategies to assess their 
performance in that context. Methods include 
identifying individual contributions to team re- 
sults using peer evaluation and statistical analysis, 
and assessing team outcomes alongside individual 
results. 

> Adopt strategies to protect against leniency 
and bias. Tactics include rating on a forced dis- 
tribution, holding managers accountable for the 
appraisals they conduct, including reviews from 
multiple perspectives with different weights, and 
providing training in the performance measure- 


ment process. Each of these strategies — including 
forced distribution systems in particular — must 
be designed carefully to offer their full benefit. 

But the alternatives typically suffer from leniency 
that robs educators of valuable feedback and stu- 
dents of the teachers they deserve. 

> Use performance measurement results. Improve- 
ments to our performance measurement systems 
will have no effect unless the results are put to 
use. States, districts, and other providers must 
commit to act upon real differences in educator ef- 
fectiveness — by using evaluations as the basis for 
teachers’ and principals’ ongoing development and 
pay, decisions about retention and dismissal, and 
future recruiting and selection. Altogether, these 
decisions form the culture in our schools — the 
culture that determines who decides to teach, how 
long they stay, what students they reach, and how 
motivated they feel to raise our children’s sights 
and conquer obstacles. 
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